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Background of the Invention 
Interest in the substrate specificity of bacterial dioxygenases 
stems from initial studies on the degradation of benzene and toluene more than 
25 years ago. A mutant strain of Pseudomonas putida (strain F39/D) was shown 

1 5 to oxidize benzene and toluene to cis-\ ,2-dihydroxycyclohexa-3,5-diene (cis- 
benzene dihydrodiol) and cw-(lS, 2/?)-dihydroxy-3-methylcyclohexa-3,5-diene 
(ds-toluene dihydrodiol), respectively (D.T. Gibson, et al, Biochemistry, 1970, 
9, 1631-1635; D.T. Gibson, et al M Biochemistry, 1970, 9, 1626-1630; and V.M. 
Kobal et al., 7. Am. Chem. Soc, 1973, 95, 4420-4421). 

20 The enzyme catalyzing these reactions, toluene dioxygenase 

(TOO), is capable of producing enantiomerically pure cyclohexadiene ds-diols 
from a wide range of aromatic substrates. D.T. Gibson, et al., Microbial 
Degradation of Organic Compounds (Gibson, D. T., ed.), pp. 181-251, Marcel 
Dekker, New York, NY (1984); D.T. Gibson, et al., Pseudomonas: 

25 biotransformations, pathogenesis, and evolving biotechnology, (Silver, S. et al., 
ed.), pp. 121-132, American Society for Microbiology, Washington D. C. 
(1990); G.N. Sheldrake, Chirality in Industry: the Commercial Manufacture and 
Application of Optically Active Compounds (Collins, A. N. et al., eds.), pp. 127- 
166, John Wiley & Sons, Chichester, UK (1992); Stabile, M. R., Ph.D. thesis. 

30 Virginia Polytechnic Institute and State University, Blacksburg, VA (1995); and 
D.R. Boyde and G.N. Sheldrake, Nat Prod. Rep. 1988, 15, 309-324. 
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In contrast to the body of work relating to TDO, relatively little 
attention has been paid to the related enzyme naphthalene dioxygenase. Cells of 
Pseudomonas sp NCD3 9816-4 contain an inducible multi-component enzyme 
system designated NDO, which initiates naphthalene catabolism by catalyzing 

5 the addition of both atoms of molecular oxygen and two hydrogen atoms to the 
substrate to yield enantiomerically pure (+)-ci>(l/?,2S)-dihydroxy-l,2- 
dihydronaphthalene (DM, Jerinaet zLArch. Biochem. Biophys, 1971, 142, 394- 
396). NDO has a relaxed substrate specificity and catalyzes the deoxygenation 
of many related 2- and 3-ring aromatic and hydroaromatic (benzocyclic) 

10 compounds to their respective ds-diols. 

The potential of NDO to form products of opposite chirality to 
those formed by TDO was first noted in 1988 during studies on the oxidation of 
indan. The major product formed by TDO was (-)-( i^-indanol (84% 
enantiomeric excess [ee]) whereas NDO produced (+)-(15)-indanol (>92% ee) 

15 (L.P. Wackett et al., Biochemistry, 1988, 27, 1360-1367. Subsequent studies 
with NDO revealed further differences in substrate specificity and suggested that 
this enzyme is an additional source of chiral intermediates and synthons for the 
enantiospecific synthesis of biologically active products. S.M. Resnick et al. 
Journal of Industrial Microbiology, 1996, 17, 438-457. 

20 NDO belongs to a family of bacterial enzymes that have an 

essential role in the recycling of carbon in nature. These enzymes are especially 
important in the degradation of aromatic hydrocarbons and related environmental 
pollutants. Knowledge of the NDO reaction mechanism is thus important in the 
development of bioremediation strategies for cleaning up environments 

25 contaminated with hazardous aromatic compounds. An attractive alternative to 
bioremediation is the application of 'green chemistry,' which refers to the 
production of industrial chemicals by processes that do not generate hazardous 
waste. For example, a recombinant strain of Escherichia coli expressing NDO, 
has been used to synthesize indigo dye from glucose. w-Arene diols produced 

30 by NDO and toluene dioxygenase have been used in the synthesis of many 
products of biological and economic importance. 
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Knowledge of the types of reactions catalyzed by NDO and the 
range of substrates oxidized by NDO is based largely on biotransformation 
studies with c^naphthalene dihydrodiol dehydrogenase (DDH) mutants, 
recombinant strains expressing NDO and purified NDO components. 
5 Pseudomonas sp 98 16/1 1 is a DDH mutant of strain 9816-4 (G.M. Klecka and 
G.T. Gibson, Biochem 1979, 180, 639-645) which accumulates cis- 
naphthalene- 1 ,2-dihydrodiol when induced cells are incubated with naphthalene 
and a suitable carbon source (D,S. Torok, J. BacterioL 1995, 777, 5799-5805. 
Studies with purified dioxygenase components have been crucial in the 

10 identification of reactions catalyzed by NDO in the absence of other host- 
associated enzyme activities which, through subsequent catalysis, have the 
potential to affect product distribution and/or stereochemistry. 

In addition to cw-dihydroxylation, NDO also catalyzes a variety 
of other oxidations which include monohydroxylation, desaturation 

15 (dehydrogenation), O- and //-dealkylation and sulfoxidation reactions. S.M. 
Resnick et aL Journal of Industrial Microbiology, 1996, 77, 438-457. Many of 
the reactions catalyzed by NDO and other microbial dioxygenases yield 
hydroxylated compounds that can serve as chiral intermediates or chiral synthons 
for a variety of compounds of interest to pharmaceutical and specialty chemical 

20 industries. 

Despite the wide range of useful oxygenated materials that can be 
prepared with TDO and NDO, there is currently a need for additional oxygenated 
chiral synthons that can be used to prepare therapeutically useful compounds, or 
useful intermediates. In particular, there is a need for additional chiral synthons 

25 that differ from the TDO and NDO products by absolute configuration or by the 
site of oxygenation. There is also a need for new methods to prepare 
hydroxylated aryl compounds for use in the polymer, resin, pharmaceutical or 
rubber industry, which generate less industrial waste than currently available 
methods. Further, there is a need for novel enzymes possessing structures, 

30 stabilities, or reactivities that differ from the native enzymes. 
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Summary of the Invention 
The crystal structure of NDO has recently been published by B. 
Kauppi et al. Structure, 1998, 6, no. 5, 571-586. Based on this structure, the 
amino acid at position 352 is located at the active site of NDO. As described 
5 hereinbelow, site-directed mutagenesis was used to construct DNA molecules 
that encode NDO mutants having amino acid substitutions at position 352. 
Changing the amino acid at position 352 from phenylalanine to valine provided 
an enzyme (SEQ ID NO:2, encoded by SEQ ID NO: 1) that gives a change in the 
preferred absolute configuration of the 1,2-dihydroxy-l ,2-dihydronaphthalene 
10 formed from naphthalene. This enzyme also gave a change in the 

regioselectivity of the products obtained from oxidation of biphenyl and 
phenanthrene. 

Accordingly, the invention provides an NDO or NDO related 
complex comprising a plurality of polypeptides, wherein the complex comprises 

15 at least one alpha-subunit polypeptide that comprises: 1) a substituted amino acid 
(e.g. valine or leucine) at the position corresponding to position 352 in NDO, 2) 
a substituted amino acid at the position corresponding to position 201, 202, 260, 
316, 351, 358, 362, or 366 in NDO, or 3) a substituted amino acid at the position 
corresponding to position 352 in NDO, and a substituted amino acid at the 

20 position corresponding to position 201, 202, 260, 316, 351, 358, 362, or 366 in 
NDO; or a catalytically active fragment thereof. The complexs of the invention 
can preferably be isolated and purified. 

25 The invention also provides an isolated and purified polypeptide 

having Swiss-prot data base Accession Number P23094 that comprises an amino 
acid other than phenylalanine at position 352, or a catalytically active variant, or 
a catalytically active fragment thereof. Preferably, the amino acid at position 352 
is a naturally occurring amino acid. More preferably, the polypeptide is SEQ ID 

30 NO:2, 32, 33, 34, 35, or 36. 

The invention also provides an isolated and purified NDO related 
polypeptide wherein the amino acid at the position corresponding to position 352 
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in NDO has been replaced with another amino acid, or a catalytically active 
fragment or catalytically active variant thereof. Preferably, the amino acid at the 
position corresponding to position 352 in NDO is a naturally occurring amino 
acid. More preferably, the amino acid at the position corresponding to position 

5 352 in NDO is valine. 

Site-directed mutagenesis was also used to construct DNA 
molecules that encode NDO mutants having an amino acid substitution at 
position 201, 202, 260, 316, 351, 352, 358, 362, or 366. Accordingly, the 
invention provides an isolated and purified NDO wherein the amino acid at 

10 position 201, 202, 260, 3 16, 351, 352, 358, 362, or 366 has been replaced with 
another amino acid, or a catalytically active variant, or a catalytically active 
fragment thereof. 

Changing the amino acid at position 352 in NDO from 
phenylalanine to valine provided an enzyme (SEQ ID NO:2, encoded by SEQ ID 

1 5 NO: 1 ) that gives a change in the preferred absolute configuration of the 1 ,2- 
dihydroxy-l,2~dihydronaphthalene formed from naphthalene. This enzyme also 
gave a change in the regioselectivity of the products obtained from oxidation of 
biphenyl and phenanthrene 

The invention also provides an isolated and purified DNA 

20 segment encoding a polypeptide of the invention, or a variant or fragment 
thereof. 

The invention also provides a primer or probe having about 80% 
nucleic acid sequence identity with a DNA segment encoding a polypeptide of 
the invention, or a variant or fragment thereof 
25 The invention also provides an expression cassette comprising a 

promotor operably linked to a DNA segment encoding a polypeptide of the 
invention or a variant or fragment thereof 

The invention also provides a host cell, the genome of which is 
augmented by a DNA segment encoding a polypeptide of the invention, or a 
30 catalytically active variant or fragment thereof 

The invention also provides a method to produce a catalytically 
active polypeptide comprising culturing a host cell transformed with a DNA 
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segment encoding a polypeptide of the invention, or a catalytically active variant 
or fragment thereof, so that the host cell expresses the polypeptide, variant or 
fragment. 

The invention also provides a method for preparing cis- 
5 naphthalene dihydrodiol (e.g. (-)-( 15,2/?)-c^-naphthaIene dihydrodiol) 
comprising contacting naphthalene with a polypeptide of the invention, or a 
catalytically active variant or fragment thereof. 

The invention also provides a method for preparing cis- 
naphthalene dihydrodiol (e.g. (-)-(15,2i?)-ci5-naphthalene dihydrodiol) 
1 0 comprising contacting a host cell of the invention with naphthalene. 

The invention also provides a method for preparing cz>biphenyl- 
3,4-dihydrodiol (e.g. (-) or (+) cw-biphenyl-3,4-dihydrodiol) comprising 
contacting biphenyl with a polypeptide of the invention, or a catalytically active 
variant or fragment thereof. The method may optionally further comprise 
1 5 dehydrating the m-biphenyl-3,4-dihydrodiol to give 4-hydroxybiphenyl. 

The invention also provides a method for preparing m-biphenyl- 
3,4-dihydrodiol (e.g, (0-cw-biphenyl-3,4-dihydrodiol) comprising contacting a 
host cell of the invention with biphenyl. The method may optionally further 
comprise dehydrating the (-)-cis-biphenyl-3,4-dihydrodiol to give 4- 
20 hydroxybiphenyL 

The invention also provides a method for preparing cis- 
phenanthrene- 1 ,2-dihydrodiol (e.g. cw-( 1 S,2R)-phenanthrene- 1 ,2-dihydrodiol) 
comprising contacting phenanthrene with a polypeptide of the invention, or a 
catalytically active variant or fragment thereof. 
25 The invention also provides a method for preparing cis- 

phenanthrene- 1 ,2-dihydrodiol (e.g. cis-( 1 S,2R)-phenanthrene- 1 ,2-dihydrodiol) 
comprising contacting a host cell of the invention with phenanthrene. 

The invention also provides a method to oxidize an aromatic 
compound to a corresponding dihydrodihydroxy compound comprising 
30 contacting the aromatic compound with a polypeptide of the invention, or a 
catalytically active variant or fragment thereof* 
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The invention also provides a method to oxidize an aromatic 
compound to a corresponding dihydrodihydroxy compound comprising 
contacting the aromatic compound with a host cell of the invention. 

The invention also provides a method to prepare an optically 
5 active ds-cyclohexadiene of formula (I): 



(i) 




wherein one of R, and R 2 is hydrogen and the other is phenyl, 2-phenylvinyl, 2- 
phenylethynyl, or vinyl, wherein any phenyl ring may optionally be substituted 
with 1, 2, or 3 substituents independently selected from the group consisting of 
10 hydroxy, halo, carboxy, cyano, nitro, trifluoromethyl, amino, (C r C 6 )aIkyl, (C r 
C 6 )alkoxycarbonyl, and (C r C 6 )aIkoxy, comprising contacting a corresponding 
compound of formula (II): 




with a polypeptide of the invention, or a catalytically active variant or fragment 
thereof, or with a host cell of the invention. Preferably, R { is phenyl or 4- 

15 hydroxyphenyl. The method may optionally comprise dehydrating the resulting 
compound of formula I. 

The invention also provides a method to prepare ci s- 1,2- 
dihydroxyindan comprising contacting indene with a polypeptide of the 
invention, or a catalytically active variant or fragment thereof, or with a host cell 

20 of the invention. 

The invention also provides a method to prepare 1,2-dihydroxy- 
1,2,3,4-tetrahydronaphthalene comprising contacting l,2-dihydronaphtha!ene 
with a polypeptide of the invention, or a catalytically active variant or fragment 
thereof, or with a host cell of the invention. 
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The invention also provides a method to prepare 1,2-dihydroxy- 
1 ,2-dihydrophenanthrene or 3,4-dihydroxy-3,4-dihydrophenanthrene comprising 
contacting phenanthrene with a polypeptide of the invention, or a catalytically 
active variant or fragment thereof, or with a host cell of the invention. 
5 The invention also provides novel compounds and intermediates 

disclosed herein, as well as crystallized forms of the polypeptides disclosed 
herein. Preferably, the invention provides a crystallized form of SEQ ED NO:26. 

Polypeptides of. the invention provide a biosynthetic route to the 
either enantiomer of a>naphthalene dihydrodiol, to either enantiomer of cis- 

1 0 biphenyl-3,4-dihydrodiol, and to either enantiomer of ds-phenanthrene- 1 ,2- 
dihydrodiol. These products can be used in the synthesis of new polymers and 
pharmaceutical products. For example, arene as-diols are useful starting 
materials for stereospecific organic synthesis (S.M. Brown, et al., Organic 
Synthesis: Theory and Applications (Hudlicky, T., ed.), pp, 1 13-176, JAI Press, 

1 5 Greenwich, CT ( 1 993); and T. Hudlicky and J.W. Reed, Adv. Asymm. Synth. 
1995, A 271-312). They can undergo a variety of reactions including 
asymmetric Diels-Alder reactions, epoxidation, photochemical oxygenation, 
metallation, diol cleavage, diene cleavage, carbene additions and ozonolysis. 
They have been used to prepare a variety of synthetic products that are not 

20 readily obtainable by conventional chemical synthesis. Examples include 
conduritols, inositol phosphates, pinitol enantiomers, prostanoid and terpene 
synthons, and complex natural products such as (-)-zeylena and (+)-lycoricidine. 



Detailed Description 

25 The following definitions are used, unless otherwise described: 

halo is fluoro, chloro, bromo, or iodo. Alkyl, alkoxy, denote both straight and 
branched groups; but reference to an individual radical such as "propyl" 
embraces only the straight chain radical, a branched chain isomer such as 
"isopropyl" being specifically referred to. 

30 The term "amino acid," comprises the residues of the natural 

occurring amino acids (e.g, Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, Hyl, 
Hyp, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Vai) in D or L form, as 
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well as unnatural amino acids (e.g. phosphoserine, phosphothreonine, 
phosphotyrosine, hydroxyproline, garnma-carboxyglutamate; hippuric acid, 
octahydroindole-2-carboxyIic acid, statine, 1,2,3,4,-tetrahydroisoquinoline- 
3-carboxyIic acid, penicillamine, ornithine, citruline, a-methyl-alanine, 
5 para-benzoylphenylaianine, phenylglycine, propargylglycine, sarcosine, and 
tert-butyl glycine). 

As used herein, the terms "isolated and/or purified" refer to in 
vitro preparation, isolation and/or purification of a nucleic acid molecule, 
sequence or segment of the invention, so that it is not associated with in vivo 

10 substances. Thus, with respect to an "isolated nucleic acid molecule sequence or 
segment", which includes a polynucleotide of DNA or RNA or of synthetic 
origin or some combination thereof, the "isolated nucleic acid molecule sequence 
or segment" (1) is not associated with all or a portion of a polynucleotide in 
which the "isolated nucleic acid molecule" is found in nature, (2) can be 

1 5 operably linked to a polynucleotide which it is not linked to in nature, or (3) does 
not occur in nature as part of a larger sequence. An isolated nucleic acid 
molecule means a polymeric form of nucleotides of at least 10 bases in length, 
either ribonucleotides or deoxynucleotides or a modified form of either type of 
nucleotide. The term includes single and double stranded forms of DNA. The 

20 term "oligonucleotide" referred to herein includes naturally occurring, and 
modified nucleotides linked together by naturally occurring, and non-naturally 
occurring oligonucleotide linkages. Oligonucleotides are a polynucleotide subset 
with 200 bases or fewer in length. Preferably, oligonucleotides are 10 to 60 
bases in length and most preferably 12, 13, 14, 15, 16, 17, 18, 19, or 20 to 40 

25 bases in length. Oligonucleotides are usually single stranded, e.g., for probes or 
primers; although oligonucleotides may be double stranded, e.g., for use in the 
construction of a variant. Oligonucleotides of the invention can be either sense 
or antisense oligonucleotides. The term "naturally occurring nucleotides" 
referred to herein includes deoxyribonucleotides and ribonucleotides. The term 

30 "modified nucleotides" referred to herein includes nucleotides with modified or 
substituted sugar groups and the like. The term "oligonucleotide linkages" 
referred to herein includes oligonucleotides linkages such as phosphorothioate, 
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phosphorodithioate, phophoroselenoate, phosphorodiselenoate, 
phosphoroanilothioate, phosphoraniladate, phosphoroamidate, and the like. An 
oligonucleotidecan include a label for detection, if desired. 

The term "isolated polypeptide" means a polypeptide encoded by 
5 DNA or RNA, including polypeptides that are synthetic in origin, or some 
combination thereof, which isolated polypeptide (i) is not associated with 
proteins found in nature, (2) is free of other proteins from the same source, (3) is 
expressed by a cell from a different species, or (4) does not occur in nature. 

The term "selectively hybridize" means to detectably and 

10 specifically bind. Polynucleotides, oligonucleotides and fragments of the 

invention selectively hybridize to nucleic acid strands under hybridization and 
wash conditions that minimize appreciable amounts of detectable binding to 
nonspecific nucleic acids. High stringency conditions can be used to achieve 
selective hybridization conditions as known in the art and discussed herein, 

15 Generally, the nucleic acid sequence identity between the polynucleotides, 
oligonucleotides, variants, and fragments of the invention and a nucleic acid 
sequence of interest is at least about 65%, and more typically with preferably 
increasing identities of at least about 70%, about 90%, about 95%, about 98%, 
and 100%. See Sambrook et aL Molecular Cloning: A Laboratory Manual, Cold 

20 Spring Harbor (1989). 

The term "corresponds to" is used herein to mean that a 
polynucleotide sequence is homologous (i.e., is identical, not strictly 
evolutionarily related) to all or a portion of a reference polynucleotide sequence, 
or that a polypeptide sequence is identical to a reference polypeptide sequence. 

25 In contradistinction, the term "complementary to" is used herein to mean that the 
complementary sequence is homologous to all or a portion of a reference 
polynucleotide sequence. For illustration, the nucleotide sequence "TATAC" 
corresponds to a reference sequence "TATAC" and is complementary to a 
reference sequence "GTATA". 

30 The term "NDO related polypeptide (or complex)" means a 

polypeptide (or complex) that belongs to the same family of bacterial enzymes as 
NDO or TOO, Preferably, an NDO related polypeptide has substantial identity 
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with NDO. More preferably, an NDO related polypeptide is encoded by the 
DNA sequence having Genbank Accession number M60405, M23914, 
ARM 0471, AFG04284, M83949, AF004283, AB004059, D84146, AF036940, 
U49504, or U62430. 

5 The term "catalytically active," when applied to a polypeptide or a 

polypeptide variant or fragment of the invention, means that the variant or 
fragment catalyzes one or more of the reactions catalyzed by NDO or an NDO 
related polypeptide (see for example S.M Resnick, et al., Journal of Industrial 
Microbiology, 1996, 17, 438-457). Preferably, a catalytically active polypeptide, 

10 or a catalytically active variant, or fragment catalyzes one or more of the 

reactions catalyzed by NDO. More preferably, a catalytically active polypeptide, 
or a catalytically active variant, or fragment catalyzes the oxidation of an 
aromatic substrate to give the corresponding dihydrodihydroxy compound (e.g. 
the oxidation of biphenyl or phenanthrene to 3,4-dihydroxy-3,4-dihydrobiphenyl 

15 or 1 ,2-dihydroxy- 1 ,2-dihydrophenanthrene, respectively). 

The term "biologically active" with respect to a fragment or 
variant of a polypeptide means that the fragment or variant has at least about 
10%, preferably at least about 50%, and most preferably at least about 90%, the 
activity of the reference polypeptide. The activity of a polypeptide of the 

20 invention can be measured by methods well known to the art including, but not 
limited to, the ability of the peptide to elicit a sequence-specific immunologic 
response when the peptide is administered to an organism, e.g., chicken, goat, 
sheep or mice. The invention also provides polypeptides, as well as fragments 
and variants thereof, that are useful to elicit a sequence-specific immunologic 

25 response when the peptide is administered to an organism (e.g. a mammal). 

When applied to a polypeptide the term "fragment" means a 
portion of the reference polypeptide that comprises the amino acid that 
corresponds to amino acid 352 in SEQ ID NO:26. Preferably the portion of the 
reference polypeptide also comprises at least about 10, 50, or 500 additional 

30 amino acids of the reference peptide. More preferably, the portion of the 
reference polypeptide comprises at least about 1000 or 1500 additional amino 
acids or the reference peptide. 
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When applied to a nucleotide sequence the term "fragment" 
means a portion of the reference nucleotide sequence that 1) encodes the amino 
acid corresponding to amino acid 352 in SEQ ID NO:26, and 2) encodes a 
catalytically active polypeptide. Preferably the portion of the reference 
5 nucleotide sequence also comprises at least about 30, 60, 150, or 300, nucleoside 
bases of the reference nucleotide sequence. More preferably, the portion of the 
reference nucleotide sequence also comprises at least about 600, 900, or 1200 
nucleoside bases of the reference nucleotide sequence. 

The following terms are used to describe the sequence 

10 relationships between two or more polynucleotides: "reference sequence", 
"comparison window", "sequence identity", "percentage of sequence identity' 1 , 
and "substantial identity". 

A "reference sequence" is a defined sequence used as a basis for a 
sequence comparison; a reference sequence may be a subset of a larger sequence, 

1 5 for example, as a segment of a full-length DNA or gene sequence given in a 
sequence listing, or may comprise a complete DNA or gene sequence. 
Generally, a reference sequence is at least 20 nucleotides in length, frequently at 
least 25 nucleotides in length, and often at least 50 nucleotides in length. Since 
two polynucleotides may each (1) comprise a sequence (i.e., a portion of the 

20 complete polynucleotide sequence) that is similar between the two 

polynucleotides, and (2) may further comprise a sequence that is divergent 
between the two polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window" to identify and compare local 

25 regions of sequence similarity. 

A "comparison window", as used herein, refers to a conceptual 
segment of at least 20 contiguous nucleotides and wherein the portion of the 
polynucleotide sequence in the comparison window may comprise additions or 
deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence 

30 (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. Optimal alignment of sequences for aligning a comparison 
window may be conducted by the local homology algorithm of Smith and 
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Waterman (1981) Adv. Appl. Math. 2: 482, by the homology alignment algorithm 
of Needleman and Wunsch (1970) J. Mol Biol 48; 443, by the search for 
similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Set (U.S.A.) 
85: 2444, by computerized implementations of these algorithms (GAP, 
5 BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package 
Release 7.0, Genetics Computer Group, 575 Science Dr, t Madison, Wis.), or by 
inspection, and the best alignment (i.e., resulting in the highest percentage of 
homology over the comparison window) generated by the various methods is 
selected. 

10 The term "sequence identity" means that two polynucleotide 

sequences are identical (i.e., on a nucleotide-by-nucleotide basis) over the 
window of comparison. "Percentage of sequence identity" is calculated by 
comparing two optimally aligned sequences over the window of comparison, 
determining the number of positions at which the identical nucleic acid base 

15 (e.g., A, T, C, G, U, or I) occurs in both sequences to yield the number of 

matched positions, dividing the number of matched positions by the total number 
of positions in the window of comparison (i.e., the window size), and 
multiplying the result by 100 to yield the percentage of sequence identity. The 
terms "substantial identity" as used herein denote a characteristic of a 

20 polynucleotide sequence, wherein the polynucleotide comprises a sequence that 
has at least about 85 percent sequence identity, preferably at least about 90 to 
about 95 percent sequence identity, more usually at least about 99 percent 
sequence identity as compared to a reference sequence over a comparison 
window of at least 20 nucleotide positions, frequently over a window of at least 

25 20-50 or 50-200 nucleotides, wherein the percentage of sequence identity is 
calculated by comparing the reference sequence to the polynucleotide sequence 
which may include deletions or additions which total 20 percent or less of the 
reference sequence over the window of comparison. 

As applied to polypeptides, the term "substantial identity" means 

30 that two peptide sequences, when optimally aligned, such as by the programs 
GAP or BESTFIT using default gap weights, share at least about 80 percent 
sequence identity, preferably at least about 80 percent sequence identity, more 
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preferably at least about 90 percent sequence identity, and most preferably at 
least about 99 percent sequence identity. Alternatively two polypeptide 
sequences have- substantial identity if they have an alignment score of at more 
than 5 (in standard deviation units) using the program ALIGN with the mutation 
5 data matrix and a gap penalty of 6 or greater. See Dayhoff, M. O., in Atlas of 
Protein Sequence and Structure, 1972, volume 5, National Biomedical Research 
Foundation, pp. 101-1 10, and Supplement 2 to this volume, pp. 1-10. 
Preferably, the two sequences have substantial identity if their amino acids are 
greater than or equal to 50% identical when optimally aligned using the ALIGN 

10 program. When a percent sequence identity is given, it means that the stated 
percentage of the amino acids are identical when the two sequences are aligned 
for maximum matching. Gaps (in either of the two sequences being matched) 
are allowed in maximizing matching; gap lengths of 5 or less are preferred with 
2 or less being more preferred. 

15 As used herein, "substantially pure" means an object species is 

the predominant species present (i.e., on a molar basis it is more abundant than 
any other individual species in the composition), and preferably a substantially 
purified fraction is a composition wherein the object species comprises at least 
about 50 percent (on a molar basis) of all macromolecular species present. 

20 Generally, a substantially pure composition will comprise more than about 80 
percent of all macromolecular species present in the composition, more 
preferably more than about 85%, about 90%, about 95%, and about 99%. Most 
preferably, the object species is purified to essential homogeneity (contaminant 
species cannot be detected in the composition by conventional detection 

25 methods) wherein the composition consists essentially of a single 
macromolecular species. 

When applied to a polypeptide the term "variant," means a 
polypeptide that 1) has substantial identity with but is not identical to the 
reference polypeptide; and 2) is identical to the reference polypeptide at the 

30 position corresponding to amino acid 352 in SEQ ID NO:26. 

When applied to a nucleotide sequence, the term "variant" means 
a nucleotide sequence that 1) has substantial identity with but is not identical to 
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the reference sequence; and 2) encodes the same amino acid as the reference 
sequence at the position encoded by the reference sequence that corresponds to 
amino acid 352. in SEQ ID NO:26. 

The variant DNA molecules of the invention may include DNA 
5 molecules with "silent" substitutions. For example, a preferred embodiment of 
the invention is an isolated and purified DNA molecule comprising a preselected 
DNA segment encoding SEQ ID NO:2> wherein the DNA segment comprises 
SEQ ID NO; 1, or variants of SEQ ID NO; 1, having nucleotide substitutions 
which are "silent" (see Table 1). That is, when silent nucleotide substitutions are 

10 present in a codon, the same amino acid is encoded by the codon with the 
nucleotide substitution as is encoded by the codon without the substitution. 
"Silent" nucleotide substitutions in SEQ ID NO: 1 which can encode a peptide 
having SEQ ID NO;2 can be ascertained by reference to Table 1 and page Dl in 
Appendix D in Sambrook et aL, Molecular Cloning: A Laboratory Manual 

1 5 (1989). Nucleotide substitutions can be introduced into DNA segments by 
methods well known to the art. See, for example, Sambrook et aL, supra. 

Table 1 





Amino Acid 


Codon 


20 


Phe 


UUU, UUC 




Ser 


UCU, UCC, UCA, UCG, AGU, AGC 




Tyr 


UAU, UAC 




Cys 


UGU.UGC 




Leu 


UUA, UUG, CUU, CUC, CUA, CUG 


25 


Trp 


UGG 




Pro 


CCU, CCC, CCA, CCG 




His 


CAU, CAC 




Arg 


CGU, CGC, CGA, CGG, AGA, AGG 




Gin 


CAA, CAG 


30 


ne 


AUU, AUC, AUA 




Thr 


ACU, ACC, ACA, ACG 




Asn 


AAU, AAC 




Lys 


AAA, AAG 
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Amino Acid Codon 

Met AUG 

Yal GUU, GUC, GU A, GUG 

Ala GCU, GCC, GCA, GCG 

Asp GAU, GAC 

Gly GGU, GGC, GGA, GGG 

Glu GAA, GAG 



Specific values listed below for radicals, substituents, and ranges, 
are for illustration only; they do not exclude other defined values or other values 

10 within defined ranges for the radicals and substituents. 

Specifically, (C r C 6 )alkyl can be methyl, ethyl, propyl, isopropyl, 
butyl, iso-butyl, sec-butyl, pentyl, 3-pentyI, or hexyl; and (C r Q)alkoxy can be 
methoxy, ethoxy, propoxy, isopropoxy, butoxy, iso-butoxy, sec-butoxy, pentoxy, 
3-pentoxy, or hexyloxy. 

1 5 Pseudomonas sp NCIB 98 1 6-4 contain an inducible 

multicomponent enzyme system designated naphthalene dioxygenase (NDO) 
which catalyzes the formation of m-naphthalene dihydrodiol. The system 
consists of an iron-sulfur flavoprotein (reductase^), a Rieske [2Fe-2S] protein 
(ferredoxin NAP ), and an iron-sulfur protein (ISP NAP ), which serves as the terminal 

20 oxygenase component, ISP^p has an a 3 p 3 subunit component and each u 
subunit contains a Rieske [2Fe-2S] cluster and mononuclear iron. The Rieske 
cluster is believed to be an electron storage center that transfers electrons to 
mononuclear iron which is responsible for dioxygen activation and ultimately the 
catalytic reaction. 

25 The genes encoding the NDO complex in Pseudomonas sp NCIB 

9816-4 have been cloned and expressed in Escherichia coli (W-C Suen and D.T. 
Gibson, Gene, 1994, 143, 67-71 ; and W-C Suen Ph.D. Thesis The University of 
Iowa, Iowa City, Iowa, 1993). The nucleotide sequences of the genes encoding 
reductase NAP (nahAa), ferredoxin NAP {nahAb\ and ISP NAP (nahAcAd) have been 

30 determined and show 93.3%, 93.3%, 96.9%, and 94.8% identity, respectively, at 
the predicted amino acid level with the isofunctional genes carried by the well- 
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studied NAH7 plasmid in P. putida G7 (S. Kurkele, Gene, 1988, 73, 355-362; 
MJ. Simon, Gene, 1993, 127, 31-37). 

A number of dioxygenases with a structure similar to NDO have 
been identified. For example, the dioxygenases shown in Table 2 have a 
5 subunits with greater than 80% amino acid identity to naphthalene dioxygenase 
from Pseudomonas sp. strain NCIB 9816-4 (calculated using GAP Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, Wisconsin). Because of their similarity to NDO in structure and 
function, it is reasonable to believe that modifying these polypeptides by 

10 replacing the amino acid corresponding to F352 in NDO with valine, will 

provide novel polypeptides that have similar advantageous properties compared 
to the native polypeptides. Accordingly, the invention also provides the 
following DNA molecules (SEQ ID No.'s 3-13) that have been modified to 
encode valine at the position corresponding to the F352 amino acid in NDO, as 

15 shown in Table 2. 



Table 2 



Genbank 


Amino Acid 


SEQ ID NO. For 


SEQ ID NO. For 


Accession 


Corresponding to 


Modified DNA 


Corresponding 


Number 


F352 in NDO 


Sequence 


Polypeptide 


M60405 


F352 


SEQIDNO:3 


SEQIDNO:14 


M23914 


F352 


SEQE>NO:4 


SEQ ID NO: 15 


AFO 10471 


F352 


SEQ ID NO:5 


SEQE>NO:16 


AF004284 


F352 


SEQIDNO:6 


SEQ ID NO: 17 


M83949 


F352 


SEQIDNO:7 


SEQ ID NO: 18 


AF004283 


F352 


SEQE>NO:8 


SEQ ID NO: 19 


AB004059 


F352 


SEQIDNO:9 


SEQ ID NO:20 


D84146 


F352 


SEQ ID NO: 10 


SEQIDNO:21 


AF036940 


F350 


SEQ ID NO: 11 


SEQIDNO:22 


U49504 


1350 


SEQ ID NO: 12 


SEQIDNO:23 


U62430 


T355 


SEQ ID NO: 13 


SEQIDNO:24 
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The invention also provides the polypeptides (SEQ ID No.'s 14-24) that are 
encoded by the DNA molecules of SEQ ID No.'s 3-13. The invention also 
provides a hostcell, the genome of which is augmented by a DNA molecule 
having SEQ ID NO:3-I3. 
5 In addition to the enantiospecific cw-dihydroxylation of 

naphthalene, NDO catalyzes dioxygenation of a variety of multicyclic and 
heterocyclic aromatic compounds to produce, in many cases, chiral cis- 
dihydrodiols. The mutant and recombinant strains expressing polypeptides of 
the invention allow the synthesis of ds-diols in high yields and/or high 

1 0 enantiomeric purity. For example, substrates which are oxidized to cis- 

dihydrodiols by NDO include indene, 1,2-dihydronaphthalene, benzocyclohept- 
1-ene, anthracene, phenanthrene, dibenzo[l ,4]dioxan, acenaphthylene, 1- and 2- 
substituted naphthalenes, biphenyl, fluorene, dibenzofuran, dibenzothiophene, 
9, 10-dihydroanthracene, and 9,10-dihydrophenanthrene. 

1 5 The DNA segment having Genbank Accession Number U49496 

is SEQ ID NO:25, The polypeptide having Swiss-prot data base Accession 
Number P23094 is SEQ ID NO:26. 

The invention will now be illustrated by the following non- 
limiting Examples. 

20 

Examples 

Example 1. Construction of site-directed mutations. 

Escherichia coli strains DH5a (Life Technologies, Gaithersburg, 
MD) and JM109(DE3)) (Promega Corp., Madison, WI) were used for subcloning 

25 and gene expression experiments, respectively. Competent E. coli strains 
ES 1301 and JM109 were purchased from Promega Corp. and used in the site- 
directed mutagenesis procedure described below. 

E. coli strains were grown at 30 or 37 °C in Luria-Bertani (LB) 
medium (R.W. Davis, et al. Advanced Bacterial Genetics, Cold Spring Harbor 

30 Laboratory, Cold Spring Harbor, N.Y., 1980), or Terrific Broth (TB) medium 
(Lee, S.-Y., and S, Rasheed, BioTechniques, 1990, 9, 676-679). To maintain 
plasmids, ampicillin or tetracycline was added to final concentrations of 150 and 
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12 pg/ml, respectively. JM109(DE3) strains carrying plasmids of interest were 
maintained on minimal medium plates (MSB) (R. Y. Stanier, et al., J. Gen. 
Microbiol, 1966, 43, 159-271) containing 10 mM glucose, 0.1 mM thiamine, 
and ampicillin. For plates, MSB was solidified with 3.8% Agar Noble (Difco 
5 Laboratories, Detroit, Mich.) and LB was solidified with 1 .5% Bactoagar (Difco 
Laboratories). 

Mutagenesis of nahAc (encoding the naphthalene dioxygenase a 
subunit) was carried out with the Altered Sites II in vitro Mutagenesis System 
according to the manufacturer's instructions (Promega Corp.). A 1 .5-kb Kpnl- 

10 Xbal fragment carrying the 3' half of the nahAc gene and the complete nahAd 
gene from pDTG141 (Suen, W.-C, Gene expression of naphthalene dioxygenase 
from Pseudomonas sp, NCIB 9816-4 in Escherichia colL Ph. D. thesis. The 
University of Iowa, Iowa City, Iowa, 1991) was cloned into £p«I-XM-digested 
pALTER- 1 (Promega Corp.). The resulting plasmid, designated pMASTER-1 , 

1 5 was used as the template for mutagenesis. 

The mutagenic oligonucleotide (5-TTCAGC GAACGGT CGG- 
GCCTGC-3') (SEQ ID NO:37) was designed such that the restriction pattern of 
the plasmid was altered (eliminating a Ry/?1406I site shown by underlined bases; 
T-G base change shown in bold) to facilitate screening for clones carrying the 

20 desired mutation. The same T-G base change alters the TTC codon specifying 
phenylalanine to GTC, which specifies valine. 

The phosphorylated oligonucleotide used for mutagenesis was 
synthesized by Genosys Biotechnologies Inc., Midland, Tex. pM ASTER- 1 
(2 jag) was denatured at room temperature for 5 min in the presence of 200 mM 

25 NaOH, 0.2 mM ethylenediamine tetraacetic acid (EDTA) in a 20 ^1 volume. 
The denatured plasmid DNA was precipitated by addition of 2 M ammonium 
acetate (2 pH 4.6) and 100% ethanol (75 pi) and incubated at -70 °C for 30 
minutes. After centrifugation for 15 minutes at 14,000 rpm in an Eppendorf 
centrifuge, the DNA pellet was dried under vacuum and dissolved in 10 mM 

30 Tris, 1 mM EDTA (100 ml; pH 8.0). 

The primer annealing reaction was carried out with the following 
components in a final volume of 20 pi: denatured pM ASTER- 1 (10 jil); 
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tetracycline repair oligonucleotide ( 1 pi; 5'-GCCGGGCCTCTTGCGGG- 
ATATCGTCCA-3') (SEQ ID NO:38); ampicillin knockout oligonucleotide (1 
jil; 5-GTTGCCATTGCTGCAG-GCATCGTGGTG-3') (SEQ ID NO:39); 
phosphorylated mutagenic oligonucleotide L25 pmol); 10X annealing buffer (2 

5 Ml; 200 mM Tris-HCl, pH 7.5, 100 mM MgCI 2 , 500 mM NaCl). 

The mixture was heated at 75 °C for 5 minutes and cooled slowly 
(1 °C per min) to 45 °C, then more rapidly to room temperature. The synthesis 
reaction mix contained the products of the annealing reaction and the following: 
sterile distilled water (5 pi); 10X synthesis buffer (3 pi, 100 mM Tris-HCl (pH 

10 7,5), 5 mM deoxyribonucleotides, 10 mM adenosine triphosphate, 20 mM 
dithiothreitol); T4 DNA polymerase (1 pi); T4 DNA ligase (1 pi). 

The mixture was incubated at 37 °C for 90 minutes, A portion of 
this mixture (1 .5 pi) was used to transform 100 pi of ES \3QlmutS competent 
cells. The transformation mix was incubated on ice for 10 minutes, heated at 

15 42°C for 45 seconds and then incubated on ice for 2 minutes. LB (900 pi) was 
added and the culture was incubated at 37 °C for 30 minutes without shaking. 
After incubation, 500 pi of this culture was diluted with 4.5 ml of LB containing 
1.25 pg/pl ampicillin and incubated over night at 37°C with shaking. Plasmid 
DNA was purified from the overnight culture as described by Lee, S*-Y., and S. 

20 Rasheed, BioTechniques, 1990, 9, 676-679, and was used to transform JM109 
using standard procedures similar to those described by RM. Ausubel et al, 
Current Protocols in Molecular Biology, John Wiley & Sons, Inc., New York, . 
1993. 

The transformation mixture was plated on LB plates containing 
25 ampicillin. Colonies were screened for tetracycline sensitivity on LB plates 
containing 12 pg/ml tetracycline. Plasmid DNA purified from ampicillin 
resistant, tetracycline sensitive strains was digested with Ac/I (New England 
Biolabs, Beverly, MA) and the restriction pattern was compared to that of 
unmutagenized pMASTER-1 using standard molecular biology methods (for 
30 example see F.M. Ausubel et al., Current Protocols in Molecular Biology, John 
Wiley & Sons, Inc., New York, 1993). The nucleotide sequences of both strands 
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of the entire insertion in pM ASTER- 1 were determined for one mutant plasmid 
that contained the restriction site change. 

Fluorescent automated DNA sequencing was carried out by the 
University of Iowa DNA Facility using an Applied Biosystems 373A automated 
5 DNA sequencer. After verification of each mutation by restriction digestion and 
sequence analysis, the 1 .5-kb KpnhXbal fragments carrying each mutation were 
individually cloned into X/?nI-X£aI-digested pDTG141 (which contains the 
nahAaAbAcAd genes for the naphthalene dioxygenase reductase, ferredoxin and 
a and p subunits of the oxygenase, respectively. The resulting plasmids were 
10 introduced into JM109(DE3) for expression studies. After this subcloning step, 
the presence of the mutation was verified by restriction analysis and one 
sequencing run with a primer that generated sequence in the region of the 
mutagenized base and continued past the Kpnl junction. 

1 5 Example 2. Biotransformations. 

To produce induced cells for biotransformation studies, 
JM109(DE3) carrying the plasmid (pDTG141-F352V) [F=phenylalanine; 
V=valine] was grown in flasks at 30°C in minimal medium (MSB) (R.Y. 
Stanier, et aL, 7. Gen. Microbiol, 1966, 43, 159-271) containing 10 mM glucose, 

20 0. 1 mM thiamine, and 150 ng/ml ampicillin with shaking (200 ipm). 

JM109(DE3)(pDTG141) was grown under identical conditions to provide the 
wjld-type control. Isopropyl-p-D-thiogalactopyranoside (EPTG) was added to a 
final concentration of 100 jjM when culture turbidity reached 0.6-0.8 at 660 nm. 
After a 2.5 hour induction period, biotransformation reactions were initiated. 

25 Cultures were spiked with 20 mM glucose, 100 mM phosphate buffer (pH 7.2), 
and 0.025% (w/v) substrate (e.g. naphthalene, biphenyl, phenanthrene). Cultures 
were incubated at 30°C with shaking (250 rpm) for up to 18 hours. Samples 
were taken periodically and cells were removed by centrifugation. 

Culture supernatants were extracted with sodium hydroxide- 

30 washed ethyl acetate and analyzed by thin layer chromatography (S.M. Resnick, 
et al, FEMS Microbiol Lett, 1993, 113, 297-302). All extracts were analyzed 
by gas chromatography-mass spectrometry (GC-MS) as previously described 
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(S.M. Resnick, S. AppL Environ. Microbiol, 1994, 60, 3323-3328). In some 
cases, dihydrodiol products were derivatized with approximately equimolar 
amounts of phenyl boronic acid at room temperature prior to injection onto the 
GC column, 

5 Generally, biotransformation products were most easily detected 

by thin-layer chromatography (TLC), high-performance liquid chromatography 
(HPLC), and/or gas chromatography-mass spectrometry (GC/MS). Preparative 
TLC, HPLC, column or radial -dispersion chromatography are commonly used 
for the isolation of oxidation products prior to nuclear magnetic resonance 
10 (NMR) structural determination and/or stereochemical analysis. 

Results from the biotransformation studies are shown in Table 3. 



Table 3 







Enantiomeric 


Dihydrodiols From 


Dihydrodiols From 


15 




Composition ofcis- 


Biphenyl 


Phenanthrene 






naphthalene 1,2- 










dihydrodiol product 








NDO Polypeptide 




%2,3 %3,4 


%3,4 %\,2 


20 


Wild Type 


>99 


87.2 12.8 


91.3 8 J 


352-valine 


92.4 


4.4 95.6 


17.0 83.0 



In the biotransformations described above, incubation of biphenyl 
with host cells carrying the plasmid (pDTG141-F352V) yielded predominantly 
3,4-dihydroxy-3,4-dihydrobiphenyl. This is in contrast to results obtained with 

25 the wild type cells, which yielded predominantly the 2,3-dihydroxy-2,3- 
dihydrobiphenyl. Additionally, incubation of phenanthrene with host cells 
carrying the plasmid (pDTG141-F352V) yielded predominantly 1,2-dihydroxy- 
1,2-dihydrophenanthrene, rather than the corresponding 3,4 isomer preferentially 
formed from the wild type cells. 

30 Thus, the polypeptides of the invention and the host cells of the 

invention are useful for preparing chiral diols for use in the polymer, resin, 
pharmaceutical or rubber industry. In particular, the polypeptides of the 
invention and the host cells of the invention are useful for preparing (-)-cis-3,4- 
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dihydroxy-3,4-dihydrobi phenyl, and a single enantiomer (1S,2R) of cis- 1,2- 
dihydroxy- 1 ,2-dihydrophenanthrene. 

The compound (-)-cw-3,4-dihydroxy-3,4-dihydrobiphenyl can be 
dehydrated to provide 4-hydroxybiphenyl, which is useful for the manufacture of 
5 rubber and resins (see The Merck Index (Martha Windholz, ed.), 10 ed., 71 87 (p- 
phenylphenol) Merck & Co. Inc. New Jersey, USA). Thus, the invention also 
provides an environmentally benign route to this useful compound. 
Additionally, the polypeptides-of the invention and the host cells of the invention 
may be useful for carrying out bioremediation. 

10 

Example 3. (+)-c&-(LR,2S)-Dihydroxy-l,2-dihydronaphthaIene 

c/j-Dihydroxy-l,2-dihydronaphthalene formed by naphthalene 
dioxygenase was purified for chiral HPLC analysis by preparative-layer 
chromatography (A.M. Jeffrey, et al. Biochemistry, 1975, 14, 575-583; S.M. 

15 Resnick, S. Appl Environ. Microbiol, 1994, 60, 3323-3328). Chiral stationary- 
phase liquid chromatography (CSP-HPLC) was used to resolve the two 
enantiomers of cw-naphthalene dihydrodiol with a Chirocel OJ column (Chiral 
Technologies, Exton, PA) as described by S.M. Resnick, S. Appl Environ. 
Microbiol., 1994, 60, 3323-3328. Under these conditions, the (+)-( \R,2S) and (- 

20 )-(\S£M) enantiomers of cw-naphthalene dihydrodiol eluted with retention times 
of 30 and 33 minutes, respectively. 

Example 4. (-)<&-Biphenyl-3,4-dihydrodioL 

A 6 L culture of JM109(DE3)(pDTG141-F352V) was grown in a 
25 10 L Biostat B fermentor (B.Braun Biotech International, Melsungen, Germany) 
in MSB at 27 °C Automated addition of NH 4 OH was used to maintain the pH 
at 7.3, and a slow glucose feed was used to maintain dissolved 0 2 concentration 
at approximately 25% saturation. The culture was induced for 3 hours with 150 
\M DPTG when the optical density of the culture (660 nm) reached 0.8. After 17 
30 hours incubation with 0,025% (w/v) biphenyl, cells were harvested by 

centrifugation and the culture supernatant was extracted with ethyl acetate and 
concentrated as described previously (S.M. Resnick, et al., FEMS Microbiol 
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LetL t 1993, 113, 297-302). Two purification methods were employed: 1) 
multiple elution preparative thin layer chromatography with a 95:5 mixture of 
chloroform and acetone as eluting solvent as previously described (D.S. Torok, 
et a!., /, BacterioL, 199S y 177, 5799-5805); and 2) radial dispersion 
5 chromatography eluting with a step gradient of chloroform-methanol (S.M. 
Resnick, S. AppL Environ, Microbiol, 1994, 60, 3323-3328). 

Over 150 mg of biphenyl 3,4-dihydrodiol was obtained from 500 
mg crude product. The purity of the compound was determined by thin layer 
chromatography, The positions of the hydroxyl groups on the aromatic ring were 

10 determined by *H NMR analysis. The enantiomeric purity was determined to be 
>97% by subjecting the corresponding phenyl-boronate derivative to gas- 
chromatography-mass spectrometry analysis (see S.M. Resnick et al., /. Org, 
Chem., 1995, 60, 3546-3549). Physical properties of the compound were as 
follows: Amax (in methanol) 276 and 228 nm, e 2 7$=4,336 ± 574 and 

1 5 € 2 28= 1 8,580 ± 1621; calculated mass of the phenyl boronate derivative 
(C 18 H 15 0 2 B) was 274.1 165, found mass, 274.1 160; [cc] D -37.5 ±3.8 (c=0.5 
g/100 mL, methanol) as determined using a Jasco P1020 polarimeter with a Na 
589 nm lamp. 

Using a procedure similar to that described in Example 3, cis- 
20 biphenyl 2,3-dihydrodiol and ris-biphenyl 3,4-dihydrodiol were separated 
following multiple elution preparative thin layer chromatography. Under the 
same chiral HPLC conditions used to separate enantiomers of m-naphthalene 
dihydrodiols, the (+)- and (-)- enantiomers of m-biphenyl 3,4-dihydrodiol eluted 
with retention times of 3 1 and 28 minutes, respectively. The (+)-(2i?,3£) and (-)- 
25 (2S,3R) enantiomers of ds-biphenyl 2,3-dihydrodiol eluted with retention times 
of 28 and 33 min, respectively. 

Example 4-HydroxybiphenyI 

Acid-catalyzed dehydration of biphenyl 3,4-dihydrodiol gave 
30 predominantly 4-hydroxybiphenyl. The addition of 50 mM HC1 (final 

concentration) to a 0.3 mM solution of biphenyl 3,4-dihydrodiol in methanol at 
room temperature gave complete dehydration in minutes. 
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Example 6 . Mutations in NDO 

Bacterial strains and plasmids. Bacterial strains and plasmids 
used in this study are listed in Table 4, Escherichia coli strains DH5a and 
5 JM 109(DE3) were used for subcloning and gene expression experiments, 

respectively. Competent E. coli strains ES 1301 and JM109 were purchased from 
Promega Corp., Madison, Wis. and used in the site-directed mutagenesis 
procedure described below. * 

Media and growth conditions. E. coli strains were grown at 

10 37°C in Luria-Bertani (LB) medium (12), or Terrific Broth (TB) medium. 
Antibiotics were added to the following final concentrations as appropriate: 
ampicillin, 150 ^g/ml; tetracycline, 12.5 ^g/mL To produce induced cells for 
biotransformation studies, JM109(DE3) strains carrying plasmids of interest 
were grown at room temperature 30°C in minimal salts medium (MSB) 

15 containing 10 mM glucose, 0.1 mM thiamine, and ampicillin. Isopropyl-p-D- 
thiogalactopyranoside (IPTG) was added to a final concentration of 100 yM 
when culture turbidity reached 0.6-0.8 at 660 nm. After a 2 hour induction, 
biotransformations were initiated as described below. For plates, MSB was 
solidified with 1.8% Noble Agar (Difco Laboratories, Detroit, Mich.) and LB 

20 was solidified with 1,5% Bactoagar (Difco Laboratories). 

Molecular techniques* Plasmid DNA was isolated as described 
previously (S-Y Lee, S. Rasheed, Biotechniques, 1990, 9, 676-679) or by using 
the Qiagen Midi Kit (Qiagen, Inc., Chatsworth, Calif.). For nucleotide 
sequencing, DNA was further purified using a Centricon-100 filter unit (Amicon, 

25 Inc., Beverly, Mass.). Restriction digests were performed as suggested by the 
enzyme suppliers (New England Biolabs, Inc., Beverly, Mass.; Promega Corp., 
Madison, Wis.). DNA fragments were purified from gel slices using the 
GeneClean Spin Kit according to the manufacturer's instructions (BIOl 01, Vista, 
Calif.). Ligation reactions, transformation of £. coli strains and agarose gel 

30 electrophoresis were performed by standard procedures. 
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Site-directed mutagenesis. Mutagenesis of nahAc was carried 
out with the Altered Sites II in vitro Mutagenesis System according to the 
manufacturer's instructions (Promega Corp., Madison, Wis*). Plasmid 
pM ASTER- i (R.E. Paraies et al., /. BacterioL, 1999, 181, 1831-1837), which 
5 contains the 3' end of the nahAc gene and the complete nahAd gene (which 
encode the a and p subunits of NDO, respectively), was used as the template for 
mutagenesis. Each mutagenic oligonucleotide was designed with a silent 
mutation that altered the restriction pattern of the plasmid (Table 5) to facilitate 
screening for clones carrying the desired mutation. Phosphorylated 

1 0 oligonucleotides used for mutagenesis were synthesized by Genosys 

Biotechnologies Inc., Midland, Tex. The nucleotide sequences of both strands of 
the entire insertion in pM ASTER- 1 were determined for each mutant. 
Fluorescent automated DNA sequencing was carried out by the University of 
Iowa DNA Facility using an Applied Biosystems 373A automated DNA 

15 sequencer. 

After verification of each mutation by restriction digestion and 
sequence analysis, the 1.5-kb KpnI-Xbal fragments carrying each mutation were 
individually cloned into KpriL-Xbal-digested pDTG141 . After this subcloning 
step, the presence of each mutation was verified by restriction and sequence 

20 analyses. The resulting derivatives of pDTG141 were introduced into 

JM109(DE3) for expression studies and in this way each protein isoform was 
produced from an identical expression system. 

Whole cell biotransformations. Induced £. coli cultures (50 ml) 
were supplemented with 20 mM glucose and 80 raM phosphate buffer (pH 7.2). 

25 Solid substrates (naphthalene, biphenyl, or phenanthrene) were added to a final 
concentration of 0.025% (w/v). Cultures were incubated at 30 °C with shaking 
(250 rpm) for 15-18 h. To obtain cells for large scale biotransformations to 
produce cw-biphenyl 3,4-dihydrodiol, JM109(DE3)(pDTG141-F352V) was 
grown at 27 °C in MSB containing glucose, thiamine, and ampicillin in a 10 L 

30 Biostat B fermentor (B. Braun Biotech International, Melsungen, Germany). 
Automated addition of NH 4 OH was used to maintain the pH at 7.3, and a slow 
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glucose feed rate was used to maintain the dissolved 0 2 concentration at 
approximately 25% saturation. 

Cultures were induced for 3 hours with 150 pM IFTG when the 
optical density of the culture (660 nm) reached approximately 0.7. Induced 
5 cultures (5.5 L)were incubated at 27°C for 14-17 h with 0.025% (w/w) substrate 
(biphenyl or phenanthrene), high agitation (700 rpm), automated pH control (pH 
7.5) and a slow glucose feed. 

Indigo formation, JM109(DE3) strains carrying pDTGHl 
derivatives with the various mutations were grown overnight at 37 °C on 
1 0 nitrocellulose filters placed on MSB agar plates containing 10 mM glucose, 

1 mM thiamine, and 150 pg/ml ampicillin. Dried Whatman #1 filter papers that 
had been soaked in a 10% solution of indole dissolved in acetone were placed in 
the Petri dish covers after colony formation. Production of indigo from indole 
vapor by NDO was observed as colonies turned blue. No induction was carried 
1 5 out for these studies. 

Separation and identification of products. Culture 
supernatants from whole cell biotransformation experiments were extracted with 
sodium hydroxide-washed ethyl acetate and analyzed by thin layer 
chromatography (TLC). Phenyl boronic acid (PBA) derivatives (A.B. Herbert, 
20 European Patent EP 0379300A2) were prepared as previously described (S,M. 
Resnick, D.T. Gibson, Appl Environ. Microbiol 1996, 62, 4073-4080). PBA- 
derivatized extracts were analyzed by gas chromatography-mass spectrometry 
(GC-MS) as previously described (S,M. Resnick, D.T. Gibson, Appl Environ. 
Microbiol 1996, 62, 3355-3359). ds-Naphthalene dihydrodiol was purified by 
25 preparative-layer chromatography (PLC) with chloroform-acetone (8:2) (S.M. 
Resnick, et al., Appl Environ. Microbiol 1994, 60, 3323-3328). 
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Regioisomers of biphenyl dihydrodiol were separated by PLC (1.0 or 
2.0 mm thickness; E. Merck Industries, Inc., Gibbstown, N.J.) using multiple 
elution (3-4 developments) with chloroform-acetone (9:1). ds-Biphenyl 3,4- 
dihydrodiol was also purified by radial-dispersion chromatography (RDC) using 
5 a Chromatotron (Harrison Research, Palo Also, Calif.), Extracts in chloroform 
containing 0. 1% triethylamine were applied to 2.0 mm-thick silica plates and 
eluted at a flow rate of 7 ml/min with a chioroform-acetone step gradient (0 to 
15% acetone in 3% steps over.l h; 0.1% triethylamine was present at each step). 
Fractions (8 ml) were analyzed by TLC, and those containing cw-biphenyi 3,4- 

1 0 dihydrodiol were combined and concentrated at 35 °C under reduced pressure. 

Chiral stationary-phase HPLC was used to resolve the enantiomers 
of ct>naphthalene dihydrodiol A Chiralcel OJ column (Chiral Technologies, 
Exton, Pa,) was used as described previously (S.ML Resnick, et al., AppL 
Environ. Microbiol. 1994, 60, 3323-3328). Under these conditions, the (+)- 

15 (1 R,2S)~ and (-)-( 1 5,2/?)- enantiomers of c/Vnaphthalene dihydrodiol eluted with 
retention times of 30 and 33 minutes, respectively. Proton ( ! H) nuclear magnetic 
resonance (NMR) spectra were acquired on the Varian UNHY-5QG 500 MHz 
spectrometer in the College of Medicine NMR Facility at the University of Iowa. 
All spectra were obtained using an 8 second relaxation delay, a 5 second 

20 acquisition time, a spectral width of 12 ppm and a 90 degree pulse width of 6.6 
microseconds. Samples were prepared as described by S.M. Resnick, et al., 
Appl. Environ. Microbiol. 1994, 60, 3323-3328). Optical rotations were 
determined at 25 °C using a Jasco PI 020 polarimeter with a 589 nm Na lamp. 
The results are the average of rotations given by three independently purified cis- 

25 biphenyl 3,4-dihydrodiol samples. High resolution mass spectra were recorded 
(by Dr. Lynn Teesch, HR-MS facility, The University of Iowa) on a VG ZAB- 
HF mass spectrometer equipped with direct inlet probe. Absorbance spectra 
(200 to 350 nm) were recorded on a Beckman DU-70 spectrophotometer. 

Chemicals, Naphthalene was obtained from Fisher Scientific Co,, 

30 Pittsburg, Penn. Indole, biphenyl, phenanthrene, and 4-hydroxybiphenyl were 
purchased from Aldrich Chemical Co., Milwaukee, Wis. Synthetic (+/-)-cw- 
naphthalene dihydrodiol and homochiral (-f )-cis-naphthalene dihydrodiol were 
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prepared as previously described (A.H. Jaffrey et al., J. Org. Chem., 1974, 39, 
1405-1407; S.M. Resnick and D.T. Gibson, Biodegredations, 1993, 4, 195-203). 
Synthetic cw-phenanthrene 9,10-dihydrodiol was provided by Dr. Derek Boyd. 
Construction and preliminary analysis of modified NDO 

5 proteins. Nine positions near the active site in the a subunit of NDO were 
chosen for site-directed mutagenesis. Based on the crystal structure of NDO, 
Asn-201, Phe-202, Val-260, Trp-316, Phe-352, Trp-358 and Met-366 are located 
near enough to the mononuclear iron to interact with substrates in the active site. 
Asn-201 is positioned too far from the iron atom to be a ligand in the crystallized 

10 form of NDO, but was suggested as a possible ligand during some stage of the 
catalytic cycle. According to the NDO structure, Asp-362 is one of three amino 
acids that coordinate the iron at the active site. Asp-362 was replaced by alanine 
in order to disrupt iron coordination. Amino acid substitutions were also made 
at Thr-351, since the corresponding amino acid has be shown to be critical in 

1 5 determining polychlorinated biphenyl (PCB) congener specificity in biphenyl 
dioxygenase. 

Site-directed mutations made in the a subunit of NDO are shown in 
Table 5. In most cases, small hydrophobic amino acids (alanine, valine, and 
leucine) were substituted for larger hydrophobic amino acids such as 

20 phenylalanine and tryptophan in order to change the size and/or shape of the 

active site pocket. In some cases, amino acid substitutions were chosen based on 
alignments of various related dioxygenase sequences. 

Indigo formation was used as an initial screen for NDO activity. 
Freshly grown cells of JM109(DE3) carrying modified pDTG141 plasmids were 

25 incubated in the presence of indole. Most strains carrying mutant NDO enzymes 
formed blue colonies in the presence of indole. Strains producing NDO 
isoforms F202L and D362A formed white colonies, suggesting that either these 
enzymes were inactive or that indole was not a substrate for the modified 
enzymes. The strain carrying the W358 A substitution in NDO formed pale blue 

30 colonies upon extended incubation with indole, indicating very weak activity 
with indole as substrate. 
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Production of mutant NDO a subunits. Formation of mutant a 
subunits was verified in Western blots using whole-cell protein samples from 
induced JM109(DE3) carrying modified pDTG 141 plasmids. A monoclonal 
antibody specific for the a subunit of NDO was used R.E. Paraies, et al., /. 
5 Bacterids 1998, 180, 2337-2344). Results show that all mutant constructs 
formed full length a subunits and there were minor variations in the amounts of 
each mutant protein produced. More importantly, these show results 
demonstrate that the inability of isoforms D362A and F202L to produce products 
was not due to the absence of protein. 

10 Biotransformations with naphthalene as substrate* Wild-type 

NDO converts naphthalene to c^-naphthalene 1,2-dihydrodioL 
Biotransformations with naphthalene resulted in the formation of as-naphthalene 
1,2-dihydrodiol by all NDO isoforms with the exception of F202L and D362A, 
which formed no product. W358A transformations were very inefficient, with 

15 less than 5% of the substrate transformed within 1 5 h as judged by GC-MS 
analysis of extracted culture supernatants. The wild-type and all mutant NDO 
enzymes formed enantiomerically pure (>99%) (+>(1J?,25)- m-naphthalene 
dihydrodiol except for those with amino acid substitutions at Phe-352. Isoforms 
F352V and F352L formed 92% and 96% (+)-(\R,2S)- ds-naphthalene 

20 dihydrodiol, respectively. This result show the importance of a specific amino 
acid, Phe-352, in determining the enantioselectivity of NDO. 

Biotransformations with biphenyl as substrate. Wild-type NDO 
oxidized biphenyl to two metabolites which were detected by TLC. The major 
metabolite (R f , 0.2) and the minor metabolite (R / , 0. 18) dehydrated to phenolic 

25 products (M + , 170) when analyzed by GC-MS. These results suggested that both 
metabolites were dihydrodiol isomers and this was confirmed by GC-MS of their 
stable respective phenyl boronic acid derivatives which gave molecular ions at 
m/e 274. The major metabolite (87% relative yield) had a retention time of 13.8 
min and was identical to ris-2,3-dihydroxy-l-phenylcyclohexa-4,6-diene (cis- 

30 biphenyl 2,3-dihydrodiol) produced from biphenyl by Sphingomonas yanoikuyae 
B8/36 (formerly Beijerinckia sp. strain B8/36) (D.T. Gibson et al., Biochem. 
Biophys. Res. Commun. 1973, 50, 21 1-219). The minor product (13% relative 
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yield) had a retention time of 14,2 min and was identified as c/.s-3,4-dihydroxy- 
l-phenylcyclohexa-l,5-diene (cis-b\ phenyl 3,4-dihydrodiol; see below). 

Isoforms F202L and D362A formed no products from biphenyl, 
Isoforms N201 A and W358A formed only a trace amounts of m-biphenyl 2,3- 
5 dihydrodiol. Amino acid substitutions at N201 , F202, V260, W3 16, and T35 1 
had slight effects on the regiospecificity of NDO as seen by the product 
distributions shown in Figure 1. However, both NDO isoforms with changes at 
position 352 formed m-biphenyl 3,4-dihydrodiol as the major product. The 
isoform with the largest specificity change, F352V, formed 96% cw-biphenyl 

10 3,4-dihydrodiol, 

Identification and characterization of m-biphenyl 3,4- 
dihydrodioh The second product ran slightly slower than cw-biphenyl 2,3- 
dihydrodiol on TLC plates and, when analyzed by GC-MS as its PBA derivative, 
had a retention time of 14,2 min compared to 13.8 min for the PBA derivative of 

15 ds-biphenyl 2,3-dihydrodiol. The products formed from biphenyl by the F352V 
isoform were isolated by RDC. Approximately 140 mg of crude extract was 
applied to a 2.0 mm-thick silica chromatotron plate and eluted as described in the 
Materials and Methods to allow isolation of 40-60 mg pure m-biphenyl 3,4- 
dihydrodiol and 1-2 mg of c/s-biphenyl 2,3-dihydrodiol (fractions eluting before 

20 the 3,4-diol). The 3,4-regiochemistry of the diol and *H NMR shift assignments 
were established by chemical shift multiplicities and independent H-H 
decoupling experiments. The 3,4-regiochemistry of the dihydrodiol was 
apparent by decoupling at H-3 (4.31 ppm) which reduced the multiplicity of the 
H-2 signal (ddd, 6.16) to a singlet with fine splitting. Acid dehydration resulted 

25 in the formation of a product that coeluted with authentic 4-hydroxybipheny] in 
TLC and GC-MS analyses. 

Physical characteristics of the cw-biphenyl 3,4-dihydrodiol were as 
follows: k max [MeOH], 204, 228, and 276 nm, = 1 1,860, e 22B = 18,580, and 
6 276 = 4,336 M" l cnr ! ; calculated mass for the phenylboronate derivative, 

30 "Cjg'Hj^O^B, 274.1 160, found mass 274.1165; mass spectrum of phenyl 
boronate derivative m/z (relative intensity), 174 (M + , 100), 170 (55), 152 (11), 
142 (84), 1 15 (22), 77 (6); [cc] D -37.5 ± 4.4, n=3 (c 0.5, MeOH); *H NMR 
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(chloroform), 6 4.21 (ddd, J=6.4, 4.0, ! .5 Hz, H-4), 4.3 1 (dd, J=6.4, 4.2 Hz, H- 
3), 6.09 (ddd, J=9.8, 4.0, 0.8 Hz, H-5), 6.16 (ddd, J=4.2, 1.7, 0.7 Hz, H-2), 6.37 
(dt, J=9.9, 1.6 Hz, H-6), 7.30 (tt, I H aromatic-p), 7.37 (m, 2H, aromatic-m), 7.46 
(m, 2H, aromatic-o). 

5 Biotransformations with phenanthrene as substrate. Since many 

of the amino acid substitutions would be predicted to increase the size of the 
NDO active site, a larger substrate, phenanthrene, was tested. Identification of 
the three regioisomers of d^-phenanthrene dihydrodiol were carried out by 
comparing GC-MS data with ds-3,4-dihydroxy-3,4-dihydrophenanthrene (cis- 

1 0 phenanthrene 3 ,4-dihydrodiol) and cis- 1 ,2-dihydroxy- 1 ,2-dihydrophenanthrene 
(ds-phenanthrene 1,2-dihydrodiol) produced by S. yanoikuyae B8/36 (D.M. 
Jerina et aL, /. Am. Chem. Soc. 1976, 98, 5988-5996), and synthetic dr-9,10- 
dihydroxy-9,10-dihydrophenanthrene (m-phenanthrene 9,10-dihydrodiol). The 
PBA derivatives of ds-phenanthrene 9,10-dihydrodiol, m-phenanthrene 3,4- 

1 5 dihydrodiol and cw-phenanthrene 1 ,2-dihydrodiol had GC retention times of 
18.0, 19.1, and 20.2 min, respectively. Wild-type NDO from Pseudomonas sp. 
strain NCIB 9816-4 formed a 9:1 mixture of ds-phenanthrene 3,4-dihydrodioI 
and cw-phenanthrene 1,2-dihydrodiol (Figure 2). These results are similar to 
those obtained with NDO from Pseudomonas sp. strain 1 19 and biphenyl 

20 dioxygenase from S. yanoikuyae B8/36 (D.M. Jerina et aL, /. Am. Chem. Soc. 
1976, P8 t 5988-5996). 

With the exception of isoforms F202L and D362A, all mutant NDO 
enzymes formed products with phenanthrene as substrate. Amino acid 
substitutions at all positions changed product ratios to some extent. Isoforms 

25 V260A and W358A preferentially oxidized phenanthrene at the C3 and C4 
positions, forming almost no ds-phenanthrene 1,2-dihydrodiol (Figure 2). 
Several isoforms, including N201A, N201S, F202V, W316A, T351R, F352V, 
F352L, and M366W produced a significantly greater proportion of cis- 
phenanthrene 1,2-dihydrodiol than did wild-type NDO (Figure 2). Of particular 

30 interest is the result with the F352V isoform. This enzyme had the opposite 
regioselectivity to wild-type NDO, forming 83% cw-phenanthrene 1,2- 
dihydrodiol in contrast to wild type, which formed 90% d?~phenanthrene 3,4- 
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dihydrodiol. The F352L isoform oxidized phenanthrene to a small amount (5% 
of the total product) of m-phenanthrene 9,10-dihydrodiol (Figure 2). 

Table 6 shows the amino acids in related dioxygenases that are 
located at positions corresponding to those mutated in NDO. Some amino acids 
5 listed in Table 6 are conserved in all of the enzymes shown (Phe-202 and Asp- 
362). In other cases, amino acids are not conserved and an amino acid in NDO 
was changed to one present in one of the other enzymes. Some of the NDO 
mutations were chosen based on the identification of amino acids critical for 
determining substrate specificity in other dioxygenases. In NDO, Thr-351, when 

10 changed to Asn, had a minor effect on product formation from phenanthrene. 
Replacement of this amino acid with Arg in NDO had a slight effect on product 
formation from biphenyl and a larger effect when phenanthrene was provided as 
the substrate (Figures 1 and 2). This position corresponds to the important 
amino acid in BPDO^^ Asn-377 that was mentioned above. 

15 Changes at Val-260 in NDO resulted in minor changes in product 

formation with biphenyl and phenanthrene. Substitution of Val for Asn260 in 
2NTDO resulted in an enzyme that no longer oxidized the aromatic ring of 2- 
nitrotoluene, forming only the monooxygenation product 2-nitrobenzylalcohol 
(J.V. Parales and D.T. Gibson, Abstracts of the 99th General Meeting of the 

20 American Society for Microbiology, 1999, Q-249, p-579). The opposite change 
in specificity did not occur with the NDO isoform V260N. Like wild-type NDO, 
the V260N isoform did not oxidize the aromatic ring of 2-nitrotoluene, but 
formed only 2-nitrobenzyI alcohol. 

Toluene dioxygenase, which has aTrp residue at the position 

25 corresponding to 366 in NDO, dihydroxylates the aromatic ring of toluene to 
form cz>toluene dihydrodiol. However, the M366W isoform of NDO oxidized 
toluene to benzyl alcohol, the same product formed by the wild-type enzyme. 
Changing Trp-316 to Ala resulted in a minor change in regioselectivity with 
phenanthrene. Changing this conserved amino acid to Phe in 2NTDO had a 

30 slight effect on the stereochemistry of cw-naphthalene dihydrodiol formed from 
naphthalene (J.V. Parales and D.T. Gibson, Abstracts of the 99th General 
Meeting of the American Society for Microbiology, 1999, Q-249, p-579). 
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The amino acid at position 352 appears to play an important role in 
controlling both the stereochemistry of m-naphthalene dihydrodiol formed from 
naphthalene, as'well as the regioselectivity with substrates such as biphenyl and 
phenanthrene. In addition, a product that is not made by wild-type NDO, cis- 
5 phenanthrene 9, 10-dihydrodioi, was formed from phenanthrene by the F352L 
isoform. 

To compare substrate specificities of NDO and the new NDO 
isoforms with those of the closely related enzymes 2NTDO and DNTDO, 
biotransformations were carried out with biphenyl and phenanthrene. Both were 

1 0 found to be poor substrates for 2NTDO and DNTDO. Both enzymes made a 
trace amount of cw-biphenyl 2,3-dihydrodiol from biphenyl, and DNTDO made 
a trace amount of phenanthrene 3,4-dihydrodiol from phenanthrene. It is not 
clear at this time why biphenyl and phenanthrene are such poor substrates for 
2NTDO and DNTDO. 

15 In NDO, Asp-205 is located between the two redox centers at the 

junction of two adjacent a subunits. Substitution of Asp-205 by glutamine 
resulted in an isoform of NDO with no activity (R.E. Parales, et al. J. BacterioL, 
1999, 181, 1831-1837). In the glutamine-containing enzyme, electron transfer 
between the Rieske center and the mononuclear iron was shown to be blocked, 

20 indicating that Asp-205 is essential for this electron transfer step to occur (RE. 
Parales, et al. I BacterioL, 1999, 181, 1831-1837), Iron at the active site of 
NDO is coordinated by His-208, His-213, and Asp-362. All three of these 
residues are conserved in the ring-hydroxylating dioxygenases whose sequences 
have been determined to date. 

25 The corresponding histidine residues in toluene dioxygenase, from 

P. putida Fl (His-222 and His 228) were replaced with alanine residues and 
these substitutions resulted in completely inactive enzymes (H. Jiang, et al., / 
BacterioL 1996, 178, 3133-3139). The inability to detect products from four 
different substrates indicates that substitution of Ala at position 362 results in an 

30 inactive form of NDO. No activity was detected in crude cell extracts of the 
D362A isoform with either oxygen uptake assays or product formation assays 
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with u C-naphthalene. These results are consistent with the identification of Asp- 
362 as a ligand to the mononuclear iron at the active site. 

Asn-201, a possible fourth iron-coordinating amino acid was 
observed in the crystal structure of NDO. This residue was too far from the iron 
5 atom to serve as a ligand in the crystallized form of the enzyme, but was 

suggested as a possible ligand during a step in the catalytic cycle (B. Kauppi, et 
aL, Structure, 1998, d, 571-586). Amino acid substitutions at Asn-201 resulted 
in enzymes with reduced but significant activity, indicating that this residue does 
not participate in the coordination of iron at the active site. Crude cell extracts of 

10 the N201A and N201Q isoforms had 5-10% of the activity of wild-type NDO. 
Results presented in Figures 1 and 2 suggest that Asn-201 may play a minor 
role in determining regioselectivity with biphenyl and phenanthrene as 
substrates. However, Asn-201 may be more important for maintaining 
appropriate interactions between a subunits through its hydrogen bond with Tyr- 

15 103 near the Rieske center in an adjacent a subunit (B. Kauppi, et aL, Structure, 
1998, 6, 571-586). Substitution of an alanine at Asn-201 would disrupt this 
hydrogen bond and could affect the flow of electrons from the Rieske center to 
the mononuclear iron, thus reducing enzyme activity. The incorporation of the 
larger Gin residue at this position may prevent the normal interaction of a 

20 subunits even though Gin would be capable of forming a hydrogen bond with 
Tyr-103. The N201S isoform showed 35-40% of the wild-type NDO activity, 
indicating that serine is a reasonably good substitute for Asn at this position 
(B.V. Plapp, Methods Enzymol, 1995, 249, 91-1 19). 

Of the three substrates tested, the most significant effects of 

25 mutations at the active site were observed with the largest substrate, 

phenanthrene. This is not an unexpected result since the substrate pocket is of 
limited size and larger substrates are likely to come in contact with more amino 
acids in the active site. Many of the mutations involved the substitution of a 
small hydrophobic amino acid for a larger one, and in most cases this type of 

30 substitution did not severely reduce the activity of the enzyme as can sometimes 
occur (M.S. Caffrey, Biochimie, 1994, 76, 622-630). However, one exception 
was the substitution of Trp-358 by Ala, which resulted in an enzyme with very 
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poor activity with naphthalene and biphenyl, but somewhat better activity with 
the larger substrate phenanthrene. 

Another exception was isoform F202L, which failed to form 
products with all substrates tested. The reason that this substitution resulted in 
5 an inactive enzyme while isoform F202V had good activity is not understood. 
Somewhat surprisingly, mutations that introduced changes in polarity or charge 
(V260N, T35 1R) resulted in enzymes with good activity toward hydrophobic 
substrates. In general, most changes at the active site, with the exception of 
those that affect iron binding (Asp-362) and electron transfer (Asp-205) were 
10 tolerated well, suggesting that there is significant flexibility in the range of 
amino acids that can be introduced at the active site. This suggests that 
oxygenases with novel catalytic capabilities can be generated by introducing 
single or multiple mutations near the active site. 

1 5 EXAMPLE 7 Other Mutants Related To SEQ ID NO:25 

Using procedures similar to those described in Example 1 (as 
described below), NDO mutant genes encoding glycine, alanine, threonine, 
leucine, isoleucine, tryptophane, or tyrosine instead of phenylalanine at position 
352 were also prepared Table 7 shows the SEQ ID No's for these DNA 
20 sequences and for proteins they encode. 
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Table 7 



Amino Acid At 


SEQ ID NO: 


SEQ ID NO: For 


Position 352 


For Modified 


Corresponding 




Sequence 


Polypeptide 


glycine 


SEQE>NO:27 


SEQIDNO:32 


alanine 


SEQ ID NO:28 


SEQIDNO.-33 


threonine 


SEQE>NO:29 


SEQIDNO:34 


leucine 


SEQIDNO:30 


SEQIDNO.-35 


isoleucine 


SEQtt>NO:31 


SEQE)NO:36 


tyrptophane 


SEQ ID NO:56 


SEQ ID N0. 58 


tyrosine 


SEQIDNO:57 


SEQIDNO:59 



Activity of modified NDO proteins. The formation of indigo from 
indole was used to screen for NDO activity. Freshly grown colonies 
JM109(DE3) carrying modified pDTG141 plasmids were incubated in the 
15 presence of indole. Strains producing NDO enzymes with the mutations F352W 
and F352Y formed white colonies, suggesting that these enzymes were inactive 
or that indole was no longer a substrate for the modified enzymes. All other 
NDO isoforms constructed in this study appeared to be active. Table 8 shows 
the substitutions in the a subunit of NDO generated by site-directed mutagenesis. 

20 



Mutation 


Mutagenic Oligonucleotide 


SEQ ID 

NO: 


Indigo 
Formation 


F352G 


5 , -GTrCAGCGAACG£^SCGGGCCTGCTGG-3 , 


60 


+ 


F352A 


S'-GrrCAGCGAACQfitCCGGGCCTGCrGG-S' 


61 


+ 


F352T 


S'-GTCCAGCGAACGACCGGGCCTGCTGG-? 


62 




F352I 


5-GTTCAGCGAACQAICGGGCCTGCTGG-3 1 


63 


+ 


F352L 


S'-TTCAGCGAACaCXCGGGCCTGC-S' 


52 


+ 


F352W 


S-GTrCAGCGAACQIfiGGGGCCTGCrGG-S' 


64 




F352Y 


5-TTCAGCGAACQ3ACGGGCCTGCTGG-3* 


65 





30 
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Underlined bases in Table 8 indicate the position of the eliminated restriction 
site, Ac/L BAase changes are in bold. Indigo formation was monitored after 
8 hours; (+) indicates pale blue colonies, (-) indicates no blue color. 

Regioselectivity of modified NDO proteins. Biotransformations 

5 with naphthalene resulted in the formation of m-l,2-dihydroxy-l,2- 

dihydronaphthalene (m-naphthalene dihydrodiol) by all NDO isoforms with 
substitutions at position 352 except F352Y, which formed no product. NDO- 
F352W transformations were very inefficient. In contrast to wild-type NDO, all 
enzymes with amino acid substitutions at position 352 formed small amounts of 

1 0 the (-)-enantiomer of m-naphthalene dihydrodiol from naphthalene as 
determined from chiral HPLC analysis (Table 9). 



Table 9 



NDO Enzyme 


cis-Naphthalene 
1,2-dihydrodiol 


cis-Biphenyl 2,3- 
dihydrodiol 


cis-Biphenyl 3,4- 
dihydrodiol 


NDO (wild type) 


>99% (+)-(lR,2S) 


>95% (+)-(2R,3S) 


>98% (+)-(3R,4S) 


P352G 


98% (+)-(lR,2S) 


>95% (+H2R.3S) 


60% (+)-(3R,4S) 


F352A 


96%(+)-(lR t 2S) 


>95% <+H2R,3S) 


65% (+H3R.4S) 


F352T 


93% (+)-(lR,2S) 


>95% (+M2R,3S) 


60% (-)-(3S,4R) 


F352V 


92%(+)-(lR,2S) 




77% (-M3S.4R) 


F352I 


94% (+M1R.2S) 


>95% (+H2R.3S) 


53% (+H3R.4S) 


F352L 


96% (+MHUS) 


>95% (+)-(2R,3S) 


70% (+)-(3R,4S) 



When biphenyl was used as a biotransformation substrate, wild-type 
NDO converted it to an 87:13 mixture of c/>biphenyl 2,3-dihydrodiol and cis- 

25 biphenyl 3,4-dihydrodiol. However, a major change in regioselectivity with 
biphenyl was seen when amino acid substitutions were introduced at F352. All 
active mutant NDO enzymes with changes at this position formed m-biphenyl 
3,4-dihydrodiol as the major product (Table 10). The F352Y isoform formed no 
detectable product from biphenyl, and F352W formed only a trace amount of cis- 

30 biphenyl 2,3-dihydrodiol. 
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Table 10 



5 



Enzyme 


Bi phenyl 
2,3-tlioi 


Biphenyl 
3,4-diol 


phenanthrene 
3,4-diol 


phenanthrene 
1,2-diof 


phenanthrene 
9,10-dio! 


NDO 


87 


13 


90 


10 




F352G 


32 


68 


79 


21 j 




F352A 


23 


77 


53 


47 




F352T 


8 


92 


59 


41 




F352V 


4 


96 . 


17 


83 




F352L 


15 


85 


64 


31 


5 


F352I 


17 


83 


76 


24 





10 

Like wild-type NDO, isoforms F352G, F352A, F352T, F352I and 
F352L formed ds-phenanthrene 3,4-dihydrodiol as the major product from 
phenanthrene, although product ratios varied significantly depending on the 
enzyme (Table 10). The F352V isoform had the opposite regioselectivity, 

1 5 forming primarily (83%) m-phenanthrene 1 ,2-dihydrodiol. Isoforms F352W 
and F352Y did not form detectable amounts of product from phenanthrene* 

The enantiomeric composition of ciVbiphenyl 2,3-dihydrodiol was 
unaffected by amino acid substitutions at this position, but that of the cis~ 
biphenyl 3,4-dihydrodiol was significantly different in all cases from that formed 

20 by wild type (Table 9). Isoforms F352V and F352T formed the opposite 
enantiomer of ds-biphenyl 3,4-dihydrodiol as wild-type NDO (Table 9). 

Absolute stereochemistry of m-biphenyl 3,4-dihydrodioi The 
formation of diastereomeric 2-(l-methoxyethyl)-phenyl boronic acid (MPBA) 
derivatives of the F352V-generated m-biphenyl 3,4-dihydrodiol provided a 

25 means for determining the enantiomeric purity of the compound. The results 
also allow an empirical prediction of absolute configuration based on trends for 
vicinal m-diols with a benzylic hydroxymethine (S.M, Resnick, et al, 5 /♦ Org. 
Chem» 1995, 60, 3546-3549). These trends were employed in the absence of 
MPBA directional shift data for a series of cw-3,4-dihydrodiols of known 

30 absolute configuration. Proton ( 1 H) nuclear magnetic resonance (NMR) spectra 
were acquired on the Bruker AMX-600 MHz NMR spectrometer at 600.14 MHz 
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in the University of Iowa High-Field NMR Facility. All spectra were obtained 
using a 14 s recovery delay, a 4.06 s acquisition time, a spectral width of 13.4 
ppm and a 90 degree pulse width of 7.5 ps. Samples were prepared as previously 
described (S. M. Resnick et al., /. Org. Chem. 9 1995, 60, 3546-3549; S.M. 
5 Resnick et al., Appi Enviorn. Microbiol, 1994, 60, 3323-3328). 'H-NMR 
analysis (d 6 -benzene) showed that the methoxy signal of the derivative formed 
with the (-)-aV3,4-biphenyl dihydrodiol and (S)-MPBA was shifted downfield 
(A5 +21 ppb; 3. 1987 ppm) relative to the corresponding signal of the (fl)-MPBA 
derivative. The enantiomeric purity of the major diol was approximately 75%, 

10 based on integration of the methoxy groups of the major and minor MPBA 
diastereomers. This result confirms the data obtained by chiral HPLC analysis 
(Table 9). The downfield shifted methoxy signal for the (S)-MPBA derivative 
would indicate an S-configuration at the benzylic carbon for a 2,3-dihydrodioL 
Application of this trend to the hydroxymethine nearest the benzylic position 

1 5 allows the suggestion of S-stereochemistry at C-3, and an absolute configuration 
of (-)-m-(3£,4#)-biphenyl dihydrodiol. 

Preparation of enantiomerically pure (-)-cw-(3S»4JR)-biphenyl 
dihydrodiol. Three dihydrodiol dehydrogenases were tested for the ability to 
oxidize the (+)- and Q-enantiomers of cw-biphenyl 2,3- and 3,4- dihydrodiols, 

20 Toluene dihydrodiol dehydrogenase from Pseudomonas putida Fl (J.E. Rogers 
and D.T. Gibson, /. BacterioU 1977, 130, 1 1 17-1 124) was shown to specifically 
attack the (+)-enantiomers of both dihydrodiols at a significant rate. Plasmid 
pDTGSl 1 carries the todD gene encoding toluene dihydrodiol dehydrogenase 
from P. putida Fl (GJ. Zylstra and D.T. Gibson Arromatic Hydrocarbon 

25 Degredation: a molecular approach, p. 183-203, in K. Setlow (ed.) Genetic 
Enginering: Principles and Methods, vol. 13, Plenum Press, New York). A 
bacterial strain was constructed which produces the F352V isoform of NDO and 
toluene dihydrodiol dehydrogenase. When this strain, JM109(DE3)(pDTG141- 
F352V)(pDTG51 1), was used in biotransformations with biphenyl, the only 

30 dihydrodiol detected by chiral HPLC anaylsis was (-)-c?>biphenyl 3,4- 
dihydrodtol. The (+)-enantiomers of m-biphenyl 2,3-dihydrodiol and cis- 
biphenyl 3,4-dihydrodiol were completely converted to the respective catechols, 



43 



compounds which were easily separated from the dihydrodiol by preparative thin 
layer chromatography. 

Absolute stereochemistry of m-phenanthrene 3,4-dihydrodiol 
and 1,2-dihydrodiol. The formation of diastereomeric MPBA derivatives of the 
5 F352V-generated mixture of m-phenanthrene dihydrodiols provided a means for 
determining the enantiomeric purity of the compounds. The results also allow an 
empirical prediction of absolute configuration based on trends for vicinal cis- 
diols with a benzylic hydroxymethine (S. M. Resnicket aL, /. Org, Chem., 1995, 
60, 3546-3549). Samples were prepared as previously described (S, M. Resnick 

10 et aL, / Org. Chem., 1995, 60, 3546-3549; S.M. Resnick et al., Appl Enviorn. 
Microbiol, 1994, 60, 3323-3328). Proton (*H) nuclear magnetic resonance 
(NMR) spectra were acquired as described above. An upfield directional shift 
for the methoxy signal of many (5)-MPBA deriviatives of polyaromatic cm- 1,2- 
dihydrodiols is indicative of an /{-configuration at the benzylic position of the 

15 ris-diol. For the cM-phenanthrene 1,2-dihydrodiol, the methoxy signal of the (S)- 
MPBA deriviative was +72 ppb downfield from the corresponding signal of the 
opposite diastereomer formed with (i?)-MPB A and predicting an S-configuration 
at the benzylic center. 

Based on trends previously documented, the absolute configuration 

20 of the major dihydrodiol formed by F352V from phenanthrene is cis-(lS,2R)- 
phenanthrene dihydrodiol (91% e.e., approx. 83% relative yield). The facial 
selectivity in this case was the opposite to that shown for wild type biphenyl 
dioxygenase from Sphingomonas yanoikuyae B8/36. Analysis of the (±)-MPBA 
derivative of the isolated cM-phenanthrene dihydrodiol fraction formed by B8/36 

25 showed resolution of the mixed racemates (of 3,4- and 1,2-diols) with minor 
methoxy signals of the 1,2-diol at 3*148 and 3.220 ppm. The same sample 
derivatized with (5)-MPBA showed the upfield shift for the methoxy signal at 
3.148 ppm which corresponds to the an ^-configuration of the benzylic center 
consistent with and confirmed by the previously determined (1J?,25)- 

30 configuration (M. Koreeda et al., /. Org. Chem,, 1978, 43, 1023-1027). The 
results of the above stereochemical correlation also suggest that the empirical 
application of the trends in the directional shifts of polycyclic aromatic diols 
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appears to be valid for both the "bay-region'* cis-3,4- and "non~bay region" cis- 
1 ,2- dihydrodiols of phenanthrene. 

The minor diol formed from phenanthrene by F352V was identified 
as a>(3£,4/?)-phenanthrene dihydrodiol (>95% e.e., 17% relative yield). This 
5 assignment is based on the correlation of the methoxy signal at 3. 1 1 5 ppm (but 
not 3.241 ppm) in the (S)-MPBA derivative of the F352V minor phenanthrene 
3,4-dihydrodiol with that of the identical directional shifts of the known B8/36 
c/s-phenanthrene 3,4-dihydrodiol derivatives. 

Relative activities of the mutant NDO enzymes. Cultures (50 ml 
10 in 500 ml flasks) were grown and induced, and biotransformations with 

naphthalene or biphenyl were initiated as described previously. Samples (1 ml 
each) were taken at 30 minute intervals over a period of 5 hours. Cells were 
removed by centrifugation and pellets were stored at -20° C for protein 
determinations, c/^-Naphthalene dihydrodiol formation was monitored at 262 

15 nm (e_= 8 1 14 M^cm' 1 ). m-Biphenyl 2,3-dihydrodiol formation was monitored 
at 303 nm (e^= 13,600 M ml cm l ; (2)). c/j-Biphenyl 3,4-dihydrodiol formation 
was monitored at 276 nm (e_~ 4340 Nf'cin" 1 ; (5)) using a correction for the 
absorbance of m-biphenyl 2,3-dihydrodiol at this wavelength. The extinction 
coefficient of ra-biphenyl 2,3-dihydrodiol at 276 nm (the Amax of of cis- 

20 biphenyl 3,4-dihydrodiol) was determined to be 7950 M^cm" 1 using purified cis- 
biphenyl 2,3-dihydrodiol from S. yanoikuyae B8/36 (2). The concentration of 
m-biphenyl 3,4-dihydrodiol was calculated using the ratios of products formed 
by each mutant enzyme and subtracting the contribution of w-biphenyl 2,3- 
dihydrodiol. Absorbance of the negative control strain (JM109(DE3)(pT7-5) 

25 was subtracted at each time point. Protein concentrations were determined by 
the method of Bradford (M.M. Bradford, Anal Biochem. 1976, 72, 248-254) 
after boiling cell pellets for 1 h in 0.1 N NaOH. Bovine serum albumin was used 
as the standard. Rates reported are the averages of three independent 
experiments. 

30 The in vivo rates of formation of cr.y-naphthalene dihydrodiol by 

wild-type and mutant NDO enzymes are shown in Table 11. 
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Tab le U 

Naphthalene Biphenyl 2,3-dihydrodiol Biphenyl 3,4dihydrodiol 

dihydrodiol formation formmation formation 



NDO 


Specific 


Relative 


Specific 


Relative 


Specific 


Relative 


Enzyme 


Activity 


Activity 


Activity 


Activity 


Activity 


Activity 




(nmol/mi 


{%) 


(nmol/m 


(7o) 


jnmoi/m 


(70) 




n/iiig; 








\t\fmci\ 
ill/ 1 1 IK/ 




NDO 


20.4 


100 


4.81 


100 


0.95 


100 


F352G 


7.4 


37 


<0.05 


<1 


<0.25 


<26 


F352A 


9.9 


49 


0.07 


1 


0.25 


26 


F352T 


15.6 


77 


0.14 


3 


0.73 


77 


F352V 


16.9 


83 


0.15 


3 


0.94 


99 


F352I 


16.5 


81 


0.16 


3 


0.74 


78 


F352L 


19.5 


96 


0.21 


4 


0.82 


86 



1 5 The F352L isoform produced m-naphthalene dihydrodiol at wild-type rates, 
while the F352T, F352V, and F352I isoforms were slightly less efficient, with 
rates 75-85% that of wild-type NDO. The F352G and F352A enzymes were the 
least efficient in catalyzing this reaction. A similar trend is seen in the rates of 
formation of cw-biphenyl 3,4-dihydrodiol from biphenyl by the enzymes with 

20 substitutions at postion 352 (Table 11). The F352T, V, I and L isoforms formed 
product at slightly reduced rates compared to wild-type NDO, while F352A was 
significantly slower and F352G rates were not measurable. In contrast, all 
enzymes with substitutions at position 352 were severely defective in forming 
ctVbiphenyl 2,3-dihydrodiol from biphenyl (Table 11). These studies 

25 demonstrate that the amino acid substitutions at position 352 result in enzymes 
with a decreased tendency to oxidize at the 2,3- position of biphenyl However, 
the rate of oxidation at the 3,4- position of biphenyl was not improved in the 
mutant enzymes. 
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All publications, patents, and patent documents are incorporated 
by reference herein, as though individually incorporated by reference. The 
invention has been described with reference to various specific and preferred 
embodiments and techniques. However, it should be understood that many 
variations and modifications may be made while remaining within the spirit and 
scope of the invention. 
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